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Point Estimators for Mean and Variance 


8.2.2 Point Estimators for Mean and Variance 


The above discussion suggests the sample mean, X, is often a reasonable point estimator for 
the mean. Now, suppose that we would like to estimate the variance of a distribution a 2 . 
Assuming 0 < cr“ < oo, by definition 


a 2 = E[(X- l u) 2 ]. 

Thus, the variance itself is the mean of the random variable Y = (X- ju) 2 . This suggests the 
following estimator for the variance 


n 

°=zY( x k -tf- 

n k= 1 

By linearity of expectation, a is an unbiased estimator of a . Also, by the weak law of large 

-2 ? 

numbers, a is also a consistent estimator of a . However, in practice we often do not know 
the value of ju. Thus, we may replace // by our estimate of the ju, the sample mean, to obtain the 
following estimator for a 2 : 


2 
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2 2> t --n 2 

n k= 1 


Using a little algebra, you can show that 


-2 , n 

S = ~ Z X l-nX 

n k= 1 


Example 8.5 

Let Xy,X 2 , X 3 , . .., X n be a random sample with mean EX i = //, and variance Var(A ; ) = a 2 . 
Suppose that we use 
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to estimate cr 2 . Find the bias of this estimator 

"2 ~2 

B(S ) = E[S ] - cr 2 . 


• Solution 

o First note that 


2 

EX = (£T) 2 + Xar(X) 



Thus, 


Therefore, 
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E[S] = - 
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cr 2 . 
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5(5 ) = £[5 ] - rr 2 
.2 


cr 

n 


2 

We conclude that S is a biased estimator of the variance. Nevertheless, note that if n is 

2 n -1 - 

relatively large, the bias is very small. Since fi[5 ] = -cr , we can obtain an unbiased 

2 2 n 

estimator of cr by multiplying S by —j-. Thus, we define 
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A u— i 
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By the above discussion, S 2 is an unbiased estimator of the variance. We call it the sample 

~2 

variance. We should note that if n is large, the difference between S 2 and S is very small. We 
also define the sample standard deviation as 


S 2 . 


Although the sample standard deviation is usually used as an estimator for the standard 
deviation, it is a biased estimator. To see this, note that S is random, so Va r(S) > 0. Thus, 

0 < Var (S) = ES 2 - (. ES ) 2 
= a 2 - (ES) 2 . 

Therefore, ES < a, which means that .S is a biased estimator of a. 

Let X|, X 2 , X 3 ,.. ., X n be a random sample with mean EX- = /u < oo, and variance 

0 < Var(X ; ) = cr 2 < oo. The sample variance of this random sample is defined as 


S 2 = 


1 


n ~ 1 
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2 >, t -x) 2 = 

k= 1 


/ 


\ 


k= 1 





The sample variance is an unbiased estimator of a 2 . The sample standard deviation is defined 
as 


and is commonly used as an estimator for a. Nevertheless, .S’ is a biased estimator of a. 

You can use the mean command in MATLAB to compute the sample mean for a given sample. 
More specifically, for a given vector 
x = [jCj, x 2 , •••, x ], mean(x) returns the sample average 

X J + X 2 + • • • + X n 
n 

Also, the functions van and std can be used to compute the sample variance and the sample 
standard deviation respectively. 
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Example 8.6 

Let Tbe the time that is needed for a specific task in a factory to be completed. In order to 
estimate the mean and variance of T, we observe a random sample T l ,T 2 ,--,T 6 . Thus, T- s are 

i.i.d. and have the same distribution as T. We obtain the following values (in minutes): 

18,21, 17, 16, 24,20. 

Find the values of the sample mean, the sample variance, and the sample standard deviation 
for the observed sample. 

• Solution 

o The sample mean is 


T = 


t i + t 2 + t 3 + t 4 + t 5 + t 6 


18 + 21 + 17+ 16 + 24 + 20 


= 19.33 

The sample variance is given by 

1 


S 2 =— l Y J (T k - 19.333) 2 = 8. 


67 


k= 1 


Finally, the sample standard deviation is given by 

S=^S 2 = 2.94 


You can use the following MATLAB code to compute the above values: 
t = [18, 21, 17, 16, 24, 20]; 
m = mean(t); 
v = var(t); 
s = std(t); 

<— previous 
next —> 
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